NoSym: Non-Symbolic Databases for Data Decoupling
نویسنده
چکیده
Under the Unique Name Assumption (UNA), users need to have shared agreements on signifiers to use in schema or data, e.g. to use “genre” and not “type” to refer to a movie’s category. Agreements are difficult in open environments such as datasets on the web, open data, and crowd-sourced databases, thus this assumption can be invalid. Schema matching and data integration can be limited in responding to this problem [2] as: (1) schemas might not be available a priori with schema-less data sources and queries becoming more common; (2) dataset-level schema/data mappings limit a user’s ability to provide a contextual interpretation of a signifier suitable for a specific query-data matching task; and (3) data integration typically has an overhead which hinders the availability and low latency of databases.
منابع مشابه
A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملBeyond the Dichotomy of Symbolic versus Substantive Actions: Evidence from Corporate Environmental Management
The symbolic management literature explores loose coupling between substantive and symbolic aspects of organizational activities. The prior literature, however, focuses on the benefits of symbolic management and tends to treat it as costless. If symbolic management is costless, presumably all firms should pursue it, yet in practice they do not. In this paper, we extend the theory of symbolic ma...
متن کاملFoundations of Data Mining and knowledge Discovery
This paper discusses a view to capture discovery as a translation from non-symbolic to symbolic representation. First, a relation between symbolic processing and non-symbolic processing is discussed. An intermediate form was introduced to represent both of them in the same framework and clarify the difference of these two. Characteristic of symbolic representation is to eliminate quantitative m...
متن کاملTesting Database Programs using Relational Symbolic Execution
Symbolic execution is a technique which allows to automatically generate test inputs (and outputs) exercising a set of execution paths within a program to be tested. If the paths cover a sufficient part of the code under test, the test data offer a representative view of the program’s actual behaviour, allowing to detect failures and correct faults. Relational databases are ubiquitous in softwa...
متن کاملChromoViz: multimodal visualization of gene expression data onto chromosomes using scalable vector graphics
SUMMARY ChromoViz is an R package for the visualization of microarray gene expression data, cross-species and cross-platform comparisons, as well as non-expression genomic data obtained from public databases onto chromosomes. Chromosomal visualization format is proposed for the clear decoupling of the data layer from the procedure layer and the combined visualization of genomic data from hetero...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017